Nonparametric estimation of long-tailed density functions and its application to the analysis of World Wide Web traffic
نویسندگان
چکیده
The study of WWW-traffic measurements has shown that different traffic characteristics can be modeled by long-tail distributed random variables (r.v.s). In this paper we discuss the nonparametric estimation of the probability density function of long-tailed distributions. Two nonparametric estimates, a Parzen–Rosenblatt kernel estimate and a histogram with variable bin width called polygram, are considered. The consistency of these estimates for heavy-tailed densities is discussed. To provide the consistency of the estimates in the metric space L1, the transformation of the initial r.v. to a new r.v. distributed on the interval [0, 1] is proposed. Then the proposed estimates are applied to analyze real data of WWW-sessions. The latter are characterized by the sizes of the responses and inter-response intervals as well as the sizes and durations of sub-sessions. By these means the effectiveness of the nonparametric procedures in comparison to parametric models of the WWW-traffic characteristics is demonstrated. © 2000 Published by Elsevier Science B.V.
منابع مشابه
Statistical Topology Using the Nonparametric Density Estimation and Bootstrap Algorithm
This paper presents approximate confidence intervals for each function of parameters in a Banach space based on a bootstrap algorithm. We apply kernel density approach to estimate the persistence landscape. In addition, we evaluate the quality distribution function estimator of random variables using integrated mean square error (IMSE). The results of simulation studies show a significant impro...
متن کاملRepresenting a method to identify and contrast with the fraud which is created by robots for developing websites’ traffic ranking
With the expansion of the Internet and the Web, communication and information gathering between individual has distracted from its traditional form and into web sites. The World Wide Web also offers a great opportunity for businesses to improve their relationship with the client and expand their marketplace in online world. Businesses use a criterion called traffic ranking to determine their si...
متن کاملMoment Inequalities for Supremum of Empirical Processes of U-Statistic Structure and Application to Density Estimation
We derive moment inequalities for the supremum of empirical processes of U-Statistic structure and give application to kernel type density estimation and estimation of the distribution function for functions of observations.
متن کاملdepth-based nonparametric multivariate analysis and its application in review of new treatment methodology on osteoarthrotic
In this article, first, we introduce depth function as a function for center-outward ranking. Then we present and use half space or Tukey depth function as one of the most popular depth functions. In the following, multivariate nonparametric tests for location and scale difference between two population are expressed by ranking and statistics based on depth versus depth plot. Finally, accord...
متن کاملSpectral Estimation of Stationary Time Series: Recent Developments
Spectral analysis considers the problem of determining (the art of recovering) the spectral content (i.e., the distribution of power over frequency) of a stationary time series from a finite set of measurements, by means of either nonparametric or parametric techniques. This paper introduces the spectral analysis problem, motivates the definition of power spectral density functions, and reviews...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Perform. Eval.
دوره 42 شماره
صفحات -
تاریخ انتشار 2000